Search CORE

4 research outputs found

Biomimetic multi-resolution analysis for robust speaker recognition

Author: C Schreiner
D Garcia-Romero
D Garcia-Romero
D Zotkin
Dmitry N Zotkin
H Beigi
H Hermansky
H Hirsch
H Steeneken
H Versnel
J Woojay
JS Garofolo
K O’Connor
K Wang
L Miller
M Elhilali
Mounya Elhilali
P Kenny
P Loizou
Q Wu
R Auckenthaler
R Drullman
Ramani Duraiswami
S Greenberg
S Greenberg
Sridhar Krishna Nemala
T Arai
T Cover
T Elliott
T Kinnunen
X Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A Multistream Feature Framework Based on Bandpass Modulation Filtering for Robust Speech Recognition

Author: Kailash Patil
Mounya Elhilali
Sridhar Krishna Nemala
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Biomimetic multi-resolution analysis for robust speaker recognition

Author: Dmitry N. Zotkin
Mounya Elhilali
Ramani Duraiswami
Sridhar Krishna Nemala
Publication venue
Publication date: 01/01/2012
Field of study

Humans exhibit a remarkable ability to reliably classify sound sources in the environment even in presence of high levels of noise. In contrast, most engineering systems suffer a drastic drop in performance when speech signals are corrupted with channel or background distortions. Our brains are equipped with elaborate machinery for speech analysis and feature extraction, understanding of which would presumably improve the performance of automatic speech processing systems under adverse conditions. The work presented here explores a biologically-motivated multi-resolution speaker information representation obtained by performing an intricate yet computationally-efficient analysis of the information-rich spectro-temporal attributes of the speech signal. We evaluate the proposed features in a speaker verification task performed on NIST SRE 2010 data. The biomimetic approach yields significant robustness in presence of non-stationary noise and reverberation, offering a new framework for deriving reliable features for speaker recognition and speech processing

CiteSeerX

Springer - Publisher Connector